DON’T SKIP THIS AND GO STRAIGHT TO THE DATA!
This is an analysis of Big Five Personality Test results. The raw data was collected by Open Psychometrics, containing over 1 million responses from hundreds of different countries. Data was collected with consent from the user.
If you’re interested in a specific part, you may skip ahead:
Introduction
Data
Further Analysis
After looking at the raw data, a few questions stood out to me:
Questions 1 through 3 are pretty data-based as the answers lie somewhere within the raw data, allowing me to hone my skills of cleaning, manipulating, and visualizing data. Question 4, on the other hand, is far more speculation and understanding based, where I draw from my experiences and knowledge to form an answer. Together, they make this project an enjoyable challenge to tackle.
In regards to Question 5, I think this project will also allow me to explore the Big Five personality model based on my own interests. I want to see the underlying psychological principals and insights the Big Five provides, and how I can apply it to my daily life. So, I want to also spend some time analyzing the system, and understanding the concept as a whole.
But, let’s take a detour from answering the above questions and establish the following:
The Big Five is a personality model that categories people with 5 major personality traits:
These five major traits can be broken down further. For example, extroversion contains ‘minor traits’ like gregariousness, assertiveness, and excitement-seeking. These traits help you better understand what this test analyzes in regards to ‘extrovertedness’, but this detail is beyond the scope of this investigation. More information can be found here.
To get your results, you determine which ‘letter’ you are for each category (e.g. If you’re more social than reserved, then you’d say that you’re an ‘S’)1 and combine your letters from all the categories together. For example, a possible result is RLOEN, standing for reserved, limbic, organized, egocentric and non-inquisitive. This can be seen by looking at the first letter of each trait.
However, if you really can’t decide what you are, another possibility is replacing letters with an ‘X’. For example, this can be seen in the potential combination of SXUAN. With the ‘X’ in the neuroticism (2nd) slot, it means that they’re in between calm and limbic as they exhibit both traits equally and there is no ‘conclusive’ answer. However, it is extremely likely for people to identify with one side slightly more than the other, so having ‘X’ (denoting that you’re perfectly in between) should not be a common occurrence.2
I’m using the SLOAN formatting, (i.e. Extroversion is represented by the 1st letter and Openness is represented by the 5th letter) and will use the above wording (i.e. Agreeableness, Neuroticism) to describe the Big Five traits throughout the investigation. If you decide to do your own research, you’ll likely see others using the CANOE or OCEAN formatting/use different words to describe the traits.
Interested in what your results are? The Open Psychometrics Big Five quiz can be taken here.
Citations
This quiz is self-administered, with 1,015,342 respondents answering 50 questions over ~2 years. Since we have 50 questions, this means that there were 10 questions from each of the majour personality traits.
Every question was ranked on a Likert scale (a 5 point scale) with 1 meaning “I disagree with the prompt”, 3 meaning “I am neutral on the prompt”, and 5 meaning “I agree with the prompt”.
Some sample prompts include:
You can find all the questions here, if you look at all the 10-item scale questions.
Questions from each category were rotated, with questions following the pattern of Extroversion Q1, Agreeableness Q1, Conscientiousness Q1, Neuroticism Q1, Openness Q1, then moving onto Extroversion Q2, Agreeableness Q2, etc.
Next: Explore the Data
Note that you’re only looking for which side you identify with more. You’re never ONLY going to be social or ONLY reserved. Human behaviour exists on a spectrum.↩︎
You’ll actually see a surprising number of respondents have an ‘X’ in their results in this data set. However, I really do believe that people always lean more to one trait more than another, and you’ll only receive an ‘X’ if you take a self-administered quiz – we’re trying to compute humans and that’s never a holistic judgement.↩︎